Probabilistic Linear Discriminant Analysis for Acoustic Modelling

نویسنده

Liang Lu

چکیده

In this letter, we propose a new acoustic modelling approach for automatic speech recognition based on probabilistic linear discriminant analysis (PLDA), which is used to model the state density function for the standard hidden Markov models (HMMs). Unlike the conventional Gaussian mixture models (GMMs) where the correlations are weakly modelled by using the diagonal covariance matrices, PLDA captures the correlations of feature vector in subspaces without vastly expanding the model. It also allows the usage of high dimensional feature input, and therefore is more flexible to make use of different type of acoustic features. We performed the preliminary experiments on the Switchboard corpus, and demonstrated the feasibility of this acoustic model.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Discrimination of Golab apple storage time using acoustic impulse response and LDA and QDA discriminant analysis techniques

ABSTRACT- Firmness is one of the most important quality indicators for apple fruits, which is highly correlated with the storage time. The acoustic impulse response technique is one of the most commonly used nondestructive detection methods for evaluating apple firmness. This paper presents a non-destructive method for classification of Iranian apple (Malus domestica Borkh. cv. Golab) according...

متن کامل

Feature-space speaker adaptation for probabilistic linear discriminant analysis acoustic models

Probabilistic linear discriminant analysis (PLDA) acoustic models extend Gaussian mixture models by factorizing the acoustic variability using state-dependent and observationdependent variables. This enables the use of higher dimensional acoustic features, and the capture of intra-frame feature correlations. In this paper, we investigate the estimation of speaker adaptive feature-space (constra...

متن کامل

Probabilistic linear discriminant analysis with bottleneck features for speech recognition

We have recently proposed a new acoustic model based on probabilistic linear discriminant analysis (PLDA) which enjoys the flexibility of using higher dimensional acoustic features, and is more capable to capture the intra-frame feature correlations. In this paper, we investigate the use of bottleneck features obtained from a deep neural network (DNN) for the PLDA-based acoustic model. Experime...

متن کامل

A probabilistic model of integration of acoustic cues in FV syllables

The interaction of consonantal and vocalic segments in FV syllables regarding identification of place of articulation of fricatives has been studied. A probabilistic model for integration of acoustic information in both segments is proposed. The model weights each segment’s contribution and integrates them in order to resemble listeners’ perception. First, the perceptual validity of the model h...

متن کامل

UTD-CRSS Systems for 2012 NIST Speaker Recognition Evaluation The CRSS SRE Team

This document briefly describes the systems submitted by the Center for Robust Speech Systems (CRSS) from The University of Texas at Dallas (UTD) for the 2012 NIST Speaker Recognition Evaluation. We developed a state-of-the-art i-vector based speaker recognition system [1]. Probabilistic linear discriminant analysis (PLDA) [2] along with several other backends are used for channel/noise compens...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2014

Probabilistic Linear Discriminant Analysis for Acoustic Modelling

نویسنده

چکیده

منابع مشابه

Discrimination of Golab apple storage time using acoustic impulse response and LDA and QDA discriminant analysis techniques

Feature-space speaker adaptation for probabilistic linear discriminant analysis acoustic models

Probabilistic linear discriminant analysis with bottleneck features for speech recognition

A probabilistic model of integration of acoustic cues in FV syllables

UTD-CRSS Systems for 2012 NIST Speaker Recognition Evaluation The CRSS SRE Team

عنوان ژورنال:

اشتراک گذاری